Deep Semantics in an NLP Knowledge Base
نویسندگان
چکیده
In most natural language processing systems there is no representation of the semantic knowledge of lexical units, but just subcategorization frames, selection restrictions and links to other paradigmatically-related lexical units. Some NLP systems, e.g. machine translation or dialogue-based systems, attempt to “understand” the input text by translating it into some kind of formal language-independent representation; this approach requires a knowledge base with conceptual representations which reflect the structure of human beings’ cognitive system. Even those systems in which surface semantics could be sufficient (e.g. automatic indexing or information extraction), the construction of a robust knowledge base guarantees its use in most natural language processing tasks, consolidating thus the concept of resource reuse. The objective of this paper is to highlight the advantages of storing conceptual meaning representations, and more particularly those in FunGramKB, instead of describing lexical meaning via semantic relations between lexical units.
منابع مشابه
A common type system for clinical natural language processing
UNLABELLED BACKGROUND One challenge in reusing clinical data stored in electronic medical records is that these data are heterogenous. Clinical Natural Language Processing (NLP) plays an important role in transforming information in clinical text to a standard representation that is comparable and interoperable. Information may be processed and shared when a type system specifies the allowab...
متن کاملQuestion Answering Based on Distributional Semantics
An NLP application for question answering provides an insight into computer’s understanding of human language. Many areas of NLP have recently built on deep learning and distributional semantic representation. This paper seeks to apply distributional semantic models and convolutional neural networks to the question answering task.
متن کاملURIEL and lang2vec: Representing languages as typological, geographical, and phylogenetic vectors
We introduce the URIEL knowledge base for massively multilingual NLP and the lang2vec utility, which provides information-rich vector identifications of languages drawn from typological, geographical, and phylogenetic databases that are normalized to have straightforward and consistent formats, naming, and semantics. The goal of URIEL and lang2vec is to enable multilingual NLP, especially on le...
متن کاملCognitive Modules of an NLP Knowledge Base for Language Understanding
Some natural language processing systems, e.g. machine translation, require a knowledge base with conceptual representations reflecting the structure of human beings’ cognitive system. In some other systems, e.g. automatic indexing or information extraction, surface semantics could be sufficient, but the construction of a robust knowledge base guarantees its use in most natural language process...
متن کاملThe Architecture of FunGramKB
Natural language understanding systems require a knowledge base provided with conceptual representations reflecting the structure of human beings’ cognitive system. Although surface semantics can be sufficient in some other systems, the construction of a robust knowledge base guarantees its use in most natural language processing applications, consolidating thus the concept of resource reuse. I...
متن کامل